Raking and Selection of Differentially Expressed Genes from Microarray Data
نویسنده
چکیده
This paper presents adaptive algorithms for ranking and selecting differentially expressed genes from microarray data. A ranking method originally proposed in [1] is adapted and supplemented with Hausdorff distancebased ranking method to improve the performance of the ranking algorithm. A weighted fusion scheme is developed to fuse the ‘mean’ and the Hausdorff distance-based ranking methods to develop a robust ranking method. The normalized consistency measure is used as the weight for the fusion of ranking methods. An adaptive subspace iteration (ASI) based selection algorithm is then applied on top ranked genes to select highly differentially expressed genes. To illustrate the utility of the proposed algorithms, a number of empirical analyses were conducted on both the simulated (400 simulated microarray dataset) and real microarray datasets (colon cancer dataset, gastric cancer dataset). From the empirical analysis it was observed that the proposed unified approach is robust against initialization and yields consistent selection of differentially expressed genes. Key-Words: Adaptive Sub-space Iteration, Clustering, Ranking, Differentially Expressed Genes and Microarray Data Analysis.
منابع مشابه
O-29: Differences in The Transcriptional Profiles of Human Cumulus Cells Isolated From MI and MII Oocytes of Patients with Polycystic Ovary Syndrome
Background: Polycystic ovary syndrome (PCOS) is a common endocrine and metabolic disorder in women. The abnormalities of endocrine and intra-ovarian paracrine interactions may change the microenvironment for oocyte development during the folliculogenesis process and reduce the developmental competence of oocytes in PCOS patients who are suffering from anovulatory infertility and pregnancy loss....
متن کاملDiagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets
With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...
متن کاملExtracellular exosomes and preeclampsia: a microarray-based study and functional enrichment analysis
Background: Preeclampsia (PE) is a heterogeneous pregnancy disease which the exact pathophysiology of it is unknown. Recently exosomes have been indicated as a causative factor in the pathogenesis of PE. The aim of the study was to investigate in microarray library data to extract the differentially expressed genes (DEGs) in PE and to perform a functional enrichment analysis to predict the rol...
متن کاملThe False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data
Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006